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Inspired by studies on the airports' network and the physical Internet, we propose a general 
model of weighted networks via an optimization principle. The topology of the optimal network 
turns out to be a spanning tree that minimizes a combination of topological and metric quantities. 
It is characterized by a strongly heterogeneous traffic, non-trivial correlations between distance and 
traffic and a broadly distributed centrality. A clear spatial hierarchical organization, with local 
hubs distributing traffic in smaller regions, emerges as a result of the optimization. Varying the 
parameters of the cost function, different classes of trees are recovered, including in particular the 
minimum spanning tree and the shortest path tree. These results suggest that a variational approach 
represents an alternative and possibly very meaningful path to the study of the structure of complex 
weighted networks. 



PACS numbers: 



'.75.-k, 



i. 75. He, 05.40 -a, 



'.75.Fb, 87.23.Ge 



Transportation and communication infrastructures 
such as the airports' network and the physical Internet 
are characterized by broad distributions of traffic Q, , 
betweenness centrality |3( and in some case also of de- 
gree Q. Strong, non- linear traffic-distance and traffic- 
connectivity correlations have also been reported Q, • 
Modelization attempts with ingredients such as random 
weights, dynamical rules or weight-topology coupled dy- 
namics (see eg. and refs. therein) have mainly fo- 
cused on growth processes and variational approaches 
have instead mainly been used only in practical problems 
by road traffic engineers H . Both the problem of opti- 
mal traffic |fj|] on a network and of optimal networks |jj 
have a long tradition in mathematics and physics. It 
is well known, for example that the laws that describe 
the flow of currents in a resistor network Q can be de- 
rived by minimizing the energy dissipated by the net- 
work |9(. On the other hand, optimal networks have 
shown to be relevant in the study of mammalians circula- 
tory systemjlfjj , food webs Jll T g eneral transportation 
networks |l2j. metabolic rates river networks 
and gas pipelines or train tracks |l5|. All these studies 
share the fact that the nodes of the network are em- 
bedded in a d-dimcnsional euclidean space which implies 
that the degree is almost always limited and the con- 
nections restricted to 'neighbours' only. A second broad 
class of optimal networks where spatial constraints are 
absent has been also recently investigated. It has been 
shown, for example, that optimization of both the aver- 
age shortest path and the total length can lead to small- 
world networks and more generally, degree corre- 
lations 0| or scale-free features can emerge from 
an optimization process. Cancho and Sole showed 
that the minimization of the average shortest path and 
the link density leads to a variety of networks including 



exponential-like graphs and scale- free networks. Guimcra 
et al. |20j studied networks with minimal search cost and 
found two classes of networks: star-like and homogeneous 
networks. Finally, Colizza et al. [ilj studied networks 
with the shortest route and the smallest congestion and 
showed that this interplay could lead to a variety of net- 
works when the number of links per node is changed. 

In this paper mostly inspired by studies on airports' 
networks 0, 0], we investigate the important case for 
which nodes are embedded in a 2-dimensional plane but 
links are not constrained (as for air routes) to connect 
'neighbours'. We propose a cost function that depends 
both on the length and the traffic carried by the links and 
show that the resulting optimal network is hierarchically 
organized in space and displays a complex traffic struc- 
ture. We consider a set of N points ('airports') randomly 
distributed in a square of unitary area and would like to 
build a network (air routes) that connects all points. The 
cost or weigth Wij associated to 'traveling' along a link 
is a function of both the length dij of the link and 
of the traffic Uj it carries. In the air-network analogy, 
the quantity represents the number of passengers on 
the link and is symmetric Uj = tji. To travel from a 
generic node iq to another generic node i p along a specific 
path {iq, ii, i%, . . . , ip-i, i p } the cost to be paid is the sum 
of the weights u>k,k+i associated to the links that com- 
pose the path, and when more than one path is available 
we assume that the most economical one is chosen 
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where the minimisation is over all paths p belonging in 
the set of paths V(io,i P ) going from iq to i p (w e is the 
weight of the edge e). If there is no path between a pair 
of nodes the corresponding cost will be taken as equal 
to infinity. This choice ensures that the optimal network 



will be connected. The global quantity £q we wish tc 
minimize is then the average cost to pay to travel from a 
generic node to another 

Our purpose is therefore to find the traffic {t*j} carried 
by the links and which minimises (J2J, with the only 
constraints that all tij > and that the total traffic 
T = J2i<j tij i s fixed- In this paper, we choose as 
the weight of a link e the ratio of its length to traffic: 
w e = d e /t e (with this choice, the value of T fixes the 
scale of traffic and does not affect the topology of the 
optimal network since a rescaling of the total ener gy b y 
a constant factor will not affect the minimization 22]). 
Although this choice is not the most general, it natu- 
rally verifies the expectation that the weight increases 
with dij and decreases with tij. This last condition can 
be easily understood in the case of transportation net- 
works and means that it is more economic to travel on 
links with a large traffic, reducing the effective distance — 
or the marginal cost — of the connection. We search for 
the minimum-realizing traffic using a zero temperature 
metropolis algorithm. The elementary move consists in 
transferring a random fraction of the traffic carried by 
a link to another one (the total traffic being fixed). We 
choose at random two links and and transfer 

weights between them according to 

tij ► tij ottjij (3) 

t\i jt ► t\i ji Qttij (4) 

where a is a uniform random number between zero and 
one. The sign of a is positive with probability p and neg- 
ative with probability 1 — p (if one of the links has a zero 
weight, the transfer can only be made in one direction, in 
other cases p = 0.5 ensures a quick convergence). If after 
the transfer the weight of a link is zero, the correspond- 
ing link is deleted. The minimum-cost path between two 
points is recalculated at any step with Djikstra's algo- 
rithm • We compute the energy difference A = £' — £q 
and only if it is negative the transfer is accepted. We 
test a number of order 0(N 2 ) of such transfers which 
converges to an optimal network which minimizes the 
energy £$. The initial topology is a complete graph with 
random weights on the links. As we show below the op- 
timal solution is characterized by a non trivial topology 
and spatial organization which results as the compromise 
of two opposing forces: the need for short routes and the 
traffic concentration on as few paths as possible. The 
interplay between topology and traffic naturally induces 
the observed correlations between degree, distance and 
traffic itself. 

Numerical simulations shows that the optimal network 
is a tree. A simple example supporting this finding is 
obtained by considering an isocele triangle ABC with 




FIG. 1: Different spanning trees obtained for different values 
of (fi, v) in Eq. © obtained for the same set of N = 1000 
nodes, (a) Minimum spanning tree obtained for (/i, v) = 
(0, 1). In this case the total distance is minimized, (b) Opti- 
mal traffic tree obtained for (fi, v) = (1/2, 1/2). In this case 
we have an interplay between centralization and minimum 
distance resulting in local hubs, (c) Minimum euclidean dis- 
tance tree obtained for (p., v) — (1, 1). In this case centrality 
dominates over distance and a 'star' structure emerges with a 
few dominant hubs, (d) Optimal betweenneess centrality tree 
obtained for (fi, v ) = (1,0). In this case we obtain the short- 
est path tree which has one star hub (for the sake of clarity, 
we omitted some links in this last figure). 

d(A, C) = d(B, C) = d and d(A, B) = d! , optimization 
leads to the values tAC = ^bc ~ T/2 and tAB ~ when 
d 3> d! . The minimum energy is thus (at leading order 
in d) £q « d/tAc + d/tsc ~ 4c?/T. When we remove 
the link BC and thus kill the loop, the traffic on AC 
becomes approximately twice higher, ie. tAc ~ T (and 
tsc ~ tAB ~ 0) but the minimum energy at leading 
order is £q rs IdjtAc ~ 2d/T which is lower than £ . This 
example shows that optimization reduces the number of 
links joining nodes in the same regions and increases the 
traffic on the remaining links. Loops between nodes in 
the same neighborhood become then redundant. 

The optimal network being a tree enormously simpli- 
fies the computation of the energy. Since only a single 
path exists between any two nodes in a tree, the energy 
(0) can be rewritten as £q = X^eer where b e is the 
edge-betweenness |28l and counts the number of times 
that e belongs to the shortest path between two nodes. 
The optimal traffic (with the same constraints as above) 
is given by t e — T\Jb e d e j ^ e \/b e d e and the topology of 
the 'optimal traffic tree' (OTT) can then be obtained by 
minimizing 

£ = v/mT (5) 

The minimal configuration can now be searched by 
rewiring links. Replacing link by {i,f) modifies 

only the centralities along the path between j and j' 
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which implies that our calculation has a complexity of 
order O(N) and allows computations over very large net- 
works (the same algorithm was used in the context of 
river networks 24]). We expect to obtain something very 
different from the classical (Euclidean-) minimum span- 
ning tree (MST) Q since £ involves a combination of 
metric (the distance) and topological (the betweenncss) 
quantities. The expression Eq. J^J) suggests an interesting 
generalization given by the optimization of 
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e6T 



(6) 



where /i and v control the relative importance of dis- 
tance against topology as measured by centrality. Fig. (JTJ 
shows examples of spanning trees obtained for different 
values of (/i, v). For (/i, v) = (0, 1) one obtains the Eu- 
clidean Minimum Spanning Tree [Fig. {%K )] which can 
also be obtained by minimizing the total weight w e 
and gives a traffic t e = T\fd~ e j \fde- F° r (Mi v ) — 
(1/2, 1/2) we obtain the OTT [Fig. Qy)] which displays 
an interesting interplay between distance and shortest 
path minimization (see below). For (fJ>,v) — (1,1), the 
energy is proportional to the average shortest weighted 
path (with weights equal to euclidean distance [Fig. QJ:)]. 
When (/i, v) = (1, 0), the energy JBJ is proportional to the 
average betweenness centrality and therefore to the av- 
erage shortest path J2 e be oc I. The tree (1,0) shown in 
Fig. QJl) is thus the shortest path tree (SPT) with an ar- 
bitrary "star-like" hub (a small non zero value of v would 
select as the star the closest node to the gravity center). 
The minimization of Eq. jnj) thus provides a natural in- 
terpolation between the MST and the SPT a problem 
which was addressed in previous studies 25]. The de- 
gree distribution for all cases considered above [with the 
possible exception (/i, v) = (1, 1) — a complete inspection 
of the plane (//, v) is left for future studies] is not broad, 
possibly as a consequence of spatial constraints. In par- 
ticular, the degree distribution for the OTT is well fitted 
by an exponential function. 

It has been shown that trees can be classified in 'univer- 
sality classes' [2(| |27j according to the size distribution 
of the two parts in which a tree can be divided by re- 
moving a link (or the sub-basins areas distribution in the 
language of river network) . We define Ai and Aj as the 
sizes of the two parts in which a generic tree is divided by 



The betweeness 6y of link (i,j) 
= l[MN - A) + Aj(N - Aj)], 



removing the link (i, j) 
can be written as bij 
and the distributions of A's and b's can be easily derived 
one from the other. It is therefore not surprising that 
the same exponent S characterizes both P(A) ~ A~ s 
and P(b) . While we obtain the value 6 — 4/3 for the 
MST 26], for the OTT we obtain (Fig. 0) an exponent 
6 ~ 2, a value also obtained for trees grown with preferen- 
tial attachment mechanism [i^ (see also jl^] for a sup- 
porting argument). Interestingly, most real- world net- 
works are also described by this value 6 ~ 2 3]. The 
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FIG. 2: Betweenness centrality distribution for the MST and 
for the OTT. The lines are power law fits and give for the 
MST the theoretical result 5 = 4/3 and for the OTT the 
value S ~ 2.0 (N = 10 4 , 100 configurations). 




FIG. 3: Hierarchical organization emerging for the optimal 
traffic tree (ji,v) = (1/2,1/2) [N = 1000 nodes). Longer 
links lead to regional hubs which in turn connect to smaller 
hubs distributing traffic in smaller regions. 



OTT thus tends to have a more uniform centrality with 
respect to the MST |3(| , with important consequences on 
the vulnerability of the network since there is no clearly 
designated 'Achille's heel' for the OTT. 

We now investigate the traffic properties of the OTT 
and we find that the traffic is scaling as iy ~ djj with 
r w 1.5 showing that large traffic is carried over large 
distance and is then dispatched on smaller hubs that 
distribute it on still smaller regions. Despite the lim- 
ited range of degrees, we also observe for the strength 
Si = j Uj a superlinear behavior with the degree. This 
result demonstrates the existence of degree-traffic cor- 
relations as observed for the airport network for exam- 
ple Q can emerge from a global optimization process. 
The spatial properties of the OTT are also remarkable 
and displays (Fig. |3J) a hierarchical spatial organization 
where long links connect regional hubs, that, in turn are 
connected to sub-regional hubs, etc. This hierarchical 
structure can be probed by measuring the average eu- 
clidean distance between nodes belonging to the largest 
cluster obtained by deleting recursively the longest link. 
For the OTT (Fig. , we observe a decrease of the region 
size, demonstrating that longer links connect smaller re- 
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FIG. 4: Average euclidean size of the largest cluster remain- 
ing after deleting links ranked according to their length (in 
decreasing order). This plot is obtained for one typical con- 
figuration of size N = 1000 and a square of area set to one. 
The decrease observed for the OTT is consistent with a hi- 
erarchical spatial organization as it is visually evident from 
Fig. El 



gions, a feature absent in non-hierarchical networks such 
as the MST, the SPT or the random tree (Fig. HJ. 

In summary, we showed that the emergence of com- 
plex structure in traffic organization could be explained 
by an optimization principle. In particular, strong cor- 
relations between distance and traffic arise naturally as 
a consequence of optimizing the average weighted short- 
est path. In the optimal network, long-range links carry 
large traffic and connect regional hubs dispatching traffic 
on a smaller scale ensuring an efficient global distribu- 
tion. These results suggest that the organization of the 
traffic on complex networks and more generally archi- 
tecture of weighted networks could in part result from 
an evolutionary process. The optimal networks obtained 
here are trees, but some transportation networks contain 
loops which reflect the fact that other ingredients are 
needed in order to describe them. Our results however 
suggest that some transportation networks could possibly 
be seen as the superposition of many trees. It would be 
interesting to check if this is the case for the world-wide 
airport network which results from the superposition of 
individual airline company networks which are probably 
close to trees. Finally, this study led us to propose a 
generalization of the usual minimum spanning tree by 
introducing the centrality and allows one to interpolate 
from the MST to the shortest path tree. This general- 
ization however deserves further studies and raises inter- 
esting question such as the crossover from different tree 
universality classes. 
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